Serveur d'exploration MERS

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

Fast and sensitive taxonomic classification for metagenomics with Kaiju

Identifieur interne : 001254 ( Main/Exploration ); précédent : 001253; suivant : 001255

Fast and sensitive taxonomic classification for metagenomics with Kaiju

Auteurs : Peter Menzel [Danemark] ; Kim Lee Ng [Danemark] ; Anders Krogh [Danemark]

Source :

RBID : PMC:4833860

Descripteurs français

English descriptors

Abstract

Metagenomics emerged as an important field of research not only in microbial ecology but also for human health and disease, and metagenomic studies are performed on increasingly larger scales. While recent taxonomic classification programs achieve high speed by comparing genomic k-mers, they often lack sensitivity for overcoming evolutionary divergence, so that large fractions of the metagenomic reads remain unclassified. Here we present the novel metagenome classifier Kaiju, which finds maximum (in-)exact matches on the protein-level using the Burrows–Wheeler transform. We show in a genome exclusion benchmark that Kaiju classifies reads with higher sensitivity and similar precision compared with current k-mer-based classifiers, especially in genera that are underrepresented in reference databases. We also demonstrate that Kaiju classifies up to 10 times more reads in real metagenomes. Kaiju can process millions of reads per minute and can run on a standard PC. Source code and web server are available at http://kaiju.binf.ku.dk.


Url:
DOI: 10.1038/ncomms11257
PubMed: 27071849
PubMed Central: 4833860


Affiliations:


Links toward previous steps (curation, corpus...)


Le document en format XML

<record>
<TEI>
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="en">Fast and sensitive taxonomic classification for metagenomics with Kaiju</title>
<author>
<name sortKey="Menzel, Peter" sort="Menzel, Peter" uniqKey="Menzel P" first="Peter" last="Menzel">Peter Menzel</name>
<affiliation wicri:level="1">
<nlm:aff id="a1">
<institution>Department of Biology, University of Copenhagen</institution>
, Copenhagen 2200,
<country>Denmark</country>
</nlm:aff>
<country xml:lang="fr">Danemark</country>
<wicri:regionArea># see nlm:aff country strict</wicri:regionArea>
</affiliation>
</author>
<author>
<name sortKey="Ng, Kim Lee" sort="Ng, Kim Lee" uniqKey="Ng K" first="Kim Lee" last="Ng">Kim Lee Ng</name>
<affiliation wicri:level="1">
<nlm:aff id="a1">
<institution>Department of Biology, University of Copenhagen</institution>
, Copenhagen 2200,
<country>Denmark</country>
</nlm:aff>
<country xml:lang="fr">Danemark</country>
<wicri:regionArea># see nlm:aff country strict</wicri:regionArea>
</affiliation>
</author>
<author>
<name sortKey="Krogh, Anders" sort="Krogh, Anders" uniqKey="Krogh A" first="Anders" last="Krogh">Anders Krogh</name>
<affiliation wicri:level="1">
<nlm:aff id="a1">
<institution>Department of Biology, University of Copenhagen</institution>
, Copenhagen 2200,
<country>Denmark</country>
</nlm:aff>
<country xml:lang="fr">Danemark</country>
<wicri:regionArea># see nlm:aff country strict</wicri:regionArea>
</affiliation>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">PMC</idno>
<idno type="pmid">27071849</idno>
<idno type="pmc">4833860</idno>
<idno type="url">http://www.ncbi.nlm.nih.gov/pmc/articles/PMC4833860</idno>
<idno type="RBID">PMC:4833860</idno>
<idno type="doi">10.1038/ncomms11257</idno>
<date when="2016">2016</date>
<idno type="wicri:Area/Pmc/Corpus">000140</idno>
<idno type="wicri:explorRef" wicri:stream="Pmc" wicri:step="Corpus" wicri:corpus="PMC">000140</idno>
<idno type="wicri:Area/Pmc/Curation">000140</idno>
<idno type="wicri:explorRef" wicri:stream="Pmc" wicri:step="Curation">000140</idno>
<idno type="wicri:Area/Pmc/Checkpoint">000B66</idno>
<idno type="wicri:explorRef" wicri:stream="Pmc" wicri:step="Checkpoint">000B66</idno>
<idno type="wicri:source">PubMed</idno>
<idno type="RBID">pubmed:27071849</idno>
<idno type="wicri:Area/PubMed/Corpus">001171</idno>
<idno type="wicri:explorRef" wicri:stream="PubMed" wicri:step="Corpus" wicri:corpus="PubMed">001171</idno>
<idno type="wicri:Area/PubMed/Curation">001171</idno>
<idno type="wicri:explorRef" wicri:stream="PubMed" wicri:step="Curation">001171</idno>
<idno type="wicri:Area/PubMed/Checkpoint">001100</idno>
<idno type="wicri:explorRef" wicri:stream="Checkpoint" wicri:step="PubMed">001100</idno>
<idno type="wicri:Area/Ncbi/Merge">001570</idno>
<idno type="wicri:Area/Ncbi/Curation">001570</idno>
<idno type="wicri:Area/Ncbi/Checkpoint">001570</idno>
<idno type="wicri:Area/Main/Merge">001258</idno>
<idno type="wicri:Area/Main/Curation">001254</idno>
<idno type="wicri:Area/Main/Exploration">001254</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title xml:lang="en" level="a" type="main">Fast and sensitive taxonomic classification for metagenomics with Kaiju</title>
<author>
<name sortKey="Menzel, Peter" sort="Menzel, Peter" uniqKey="Menzel P" first="Peter" last="Menzel">Peter Menzel</name>
<affiliation wicri:level="1">
<nlm:aff id="a1">
<institution>Department of Biology, University of Copenhagen</institution>
, Copenhagen 2200,
<country>Denmark</country>
</nlm:aff>
<country xml:lang="fr">Danemark</country>
<wicri:regionArea># see nlm:aff country strict</wicri:regionArea>
</affiliation>
</author>
<author>
<name sortKey="Ng, Kim Lee" sort="Ng, Kim Lee" uniqKey="Ng K" first="Kim Lee" last="Ng">Kim Lee Ng</name>
<affiliation wicri:level="1">
<nlm:aff id="a1">
<institution>Department of Biology, University of Copenhagen</institution>
, Copenhagen 2200,
<country>Denmark</country>
</nlm:aff>
<country xml:lang="fr">Danemark</country>
<wicri:regionArea># see nlm:aff country strict</wicri:regionArea>
</affiliation>
</author>
<author>
<name sortKey="Krogh, Anders" sort="Krogh, Anders" uniqKey="Krogh A" first="Anders" last="Krogh">Anders Krogh</name>
<affiliation wicri:level="1">
<nlm:aff id="a1">
<institution>Department of Biology, University of Copenhagen</institution>
, Copenhagen 2200,
<country>Denmark</country>
</nlm:aff>
<country xml:lang="fr">Danemark</country>
<wicri:regionArea># see nlm:aff country strict</wicri:regionArea>
</affiliation>
</author>
</analytic>
<series>
<title level="j">Nature Communications</title>
<idno type="eISSN">2041-1723</idno>
<imprint>
<date when="2016">2016</date>
</imprint>
</series>
</biblStruct>
</sourceDesc>
</fileDesc>
<profileDesc>
<textClass>
<keywords scheme="KwdEn" xml:lang="en">
<term>Algorithms</term>
<term>Amino Acid Sequence</term>
<term>Animals</term>
<term>Classification</term>
<term>Humans</term>
<term>Metagenome</term>
<term>Metagenomics (classification)</term>
<term>Proteins (chemistry)</term>
</keywords>
<keywords scheme="KwdFr" xml:lang="fr">
<term>Algorithmes</term>
<term>Animaux</term>
<term>Classification</term>
<term>Humains</term>
<term>Métagénome</term>
<term>Métagénomique ()</term>
<term>Protéines ()</term>
<term>Séquence d'acides aminés</term>
</keywords>
<keywords scheme="MESH" type="chemical" qualifier="chemistry" xml:lang="en">
<term>Proteins</term>
</keywords>
<keywords scheme="MESH" qualifier="classification" xml:lang="en">
<term>Metagenomics</term>
</keywords>
<keywords scheme="MESH" xml:lang="en">
<term>Algorithms</term>
<term>Amino Acid Sequence</term>
<term>Animals</term>
<term>Classification</term>
<term>Humans</term>
<term>Metagenome</term>
</keywords>
<keywords scheme="MESH" xml:lang="fr">
<term>Algorithmes</term>
<term>Animaux</term>
<term>Classification</term>
<term>Humains</term>
<term>Métagénome</term>
<term>Métagénomique</term>
<term>Protéines</term>
<term>Séquence d'acides aminés</term>
</keywords>
</textClass>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">
<p>Metagenomics emerged as an important field of research not only in microbial ecology but also for human health and disease, and metagenomic studies are performed on increasingly larger scales. While recent taxonomic classification programs achieve high speed by comparing genomic
<italic>k</italic>
-mers, they often lack sensitivity for overcoming evolutionary divergence, so that large fractions of the metagenomic reads remain unclassified. Here we present the novel metagenome classifier Kaiju, which finds maximum (in-)exact matches on the protein-level using the Burrows–Wheeler transform. We show in a genome exclusion benchmark that Kaiju classifies reads with higher sensitivity and similar precision compared with current
<italic>k</italic>
-mer-based classifiers, especially in genera that are underrepresented in reference databases. We also demonstrate that Kaiju classifies up to 10 times more reads in real metagenomes. Kaiju can process millions of reads per minute and can run on a standard PC. Source code and web server are available at
<ext-link ext-link-type="uri" xlink:href="http://kaiju.binf.ku.dk">http://kaiju.binf.ku.dk</ext-link>
.</p>
</div>
</front>
<back>
<div1 type="bibliography">
<listBibl>
<biblStruct>
<analytic>
<author>
<name sortKey="Riesenfeld, C" uniqKey="Riesenfeld C">C. Riesenfeld</name>
</author>
<author>
<name sortKey="Schloss, P" uniqKey="Schloss P">P. Schloss</name>
</author>
<author>
<name sortKey="Handelsman, J" uniqKey="Handelsman J">J. Handelsman</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Shokralla, S" uniqKey="Shokralla S">S. Shokralla</name>
</author>
<author>
<name sortKey="Spall, J" uniqKey="Spall J">J. Spall</name>
</author>
<author>
<name sortKey="Gibson, J" uniqKey="Gibson J">J. Gibson</name>
</author>
<author>
<name sortKey="Hajibabaei, M" uniqKey="Hajibabaei M">M. Hajibabaei</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Segata, N" uniqKey="Segata N">N. Segata</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Kinross, J" uniqKey="Kinross J">J. Kinross</name>
</author>
<author>
<name sortKey="Von Roon, A" uniqKey="Von Roon A">A. von Roon</name>
</author>
<author>
<name sortKey="Holmes, E" uniqKey="Holmes E">E. Holmes</name>
</author>
<author>
<name sortKey="Darzi, A" uniqKey="Darzi A">A. Darzi</name>
</author>
<author>
<name sortKey="Nicholson, J" uniqKey="Nicholson J">J. Nicholson</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Wade, W" uniqKey="Wade W">W. Wade</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Fonseca, N" uniqKey="Fonseca N">N. Fonseca</name>
</author>
<author>
<name sortKey="Rung, J" uniqKey="Rung J">J. Rung</name>
</author>
<author>
<name sortKey="Brazma, A" uniqKey="Brazma A">A. Brazma</name>
</author>
<author>
<name sortKey="Marioni, J" uniqKey="Marioni J">J. Marioni</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Ames, S" uniqKey="Ames S">S. Ames</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Wood, D" uniqKey="Wood D">D. Wood</name>
</author>
<author>
<name sortKey="Salzberg, S" uniqKey="Salzberg S">S. Salzberg</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Ounit, R" uniqKey="Ounit R">R. Ounit</name>
</author>
<author>
<name sortKey="Wanamaker, S" uniqKey="Wanamaker S">S. Wanamaker</name>
</author>
<author>
<name sortKey="Close, T" uniqKey="Close T">T. Close</name>
</author>
<author>
<name sortKey="Lonardi, S" uniqKey="Lonardi S">S. Lonardi</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Cleary, B" uniqKey="Cleary B">B. Cleary</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Menzel, P" uniqKey="Menzel P">P. Menzel</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Sunagawa, S" uniqKey="Sunagawa S">S. Sunagawa</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Bentley, S" uniqKey="Bentley S">S. Bentley</name>
</author>
<author>
<name sortKey="Parkhill, J" uniqKey="Parkhill J">J. Parkhill</name>
</author>
</analytic>
</biblStruct>
<biblStruct></biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Altschul, S" uniqKey="Altschul S">S. Altschul</name>
</author>
<author>
<name sortKey="Gish, W" uniqKey="Gish W">W. Gish</name>
</author>
<author>
<name sortKey="Miller, W" uniqKey="Miller W">W. Miller</name>
</author>
<author>
<name sortKey="Myers, E" uniqKey="Myers E">E. Myers</name>
</author>
<author>
<name sortKey="Lipman, D" uniqKey="Lipman D">D. Lipman</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Zhao, Y" uniqKey="Zhao Y">Y. Zhao</name>
</author>
<author>
<name sortKey="Tang, H" uniqKey="Tang H">H. Tang</name>
</author>
<author>
<name sortKey="Ye, Y" uniqKey="Ye Y">Y. Ye</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Buchfink, B" uniqKey="Buchfink B">B. Buchfink</name>
</author>
<author>
<name sortKey="Xie, C" uniqKey="Xie C">C. Xie</name>
</author>
<author>
<name sortKey="Huson, D" uniqKey="Huson D">D. Huson</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Lindgreen, S" uniqKey="Lindgreen S">S. Lindgreen</name>
</author>
<author>
<name sortKey="Adair, K" uniqKey="Adair K">K. Adair</name>
</author>
<author>
<name sortKey="Gardner, P" uniqKey="Gardner P">P. Gardner</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Li, H" uniqKey="Li H">H. Li</name>
</author>
<author>
<name sortKey="Durbin, R" uniqKey="Durbin R">R. Durbin</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Ferragina, P" uniqKey="Ferragina P">P. Ferragina</name>
</author>
<author>
<name sortKey="Manzini, G" uniqKey="Manzini G">G. Manzini</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Frellsen, J" uniqKey="Frellsen J">J. Frellsen</name>
</author>
<author>
<name sortKey="Menzel, P" uniqKey="Menzel P">P. Menzel</name>
</author>
<author>
<name sortKey="Krogh, A" uniqKey="Krogh A">A. Krogh</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Burrows, M" uniqKey="Burrows M">M. Burrows</name>
</author>
<author>
<name sortKey="Wheeler, D" uniqKey="Wheeler D">D. Wheeler</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Liu, Y" uniqKey="Liu Y">Y. Liu</name>
</author>
<author>
<name sortKey="Schmidt, B" uniqKey="Schmidt B">B. Schmidt</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Li, H" uniqKey="Li H">H. Li</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Jiang, M" uniqKey="Jiang M">M. Jiang</name>
</author>
<author>
<name sortKey="Anderson, J" uniqKey="Anderson J">J. Anderson</name>
</author>
<author>
<name sortKey="Gillespie, J" uniqKey="Gillespie J">J. Gillespie</name>
</author>
<author>
<name sortKey="Mayne, M" uniqKey="Mayne M">M. Mayne</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Ondov, B" uniqKey="Ondov B">B. Ondov</name>
</author>
<author>
<name sortKey="Bergman, N" uniqKey="Bergman N">N. Bergman</name>
</author>
<author>
<name sortKey="Phillippy, A" uniqKey="Phillippy A">A. Phillippy</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Huang, W" uniqKey="Huang W">W. Huang</name>
</author>
<author>
<name sortKey="Li, L" uniqKey="Li L">L. Li</name>
</author>
<author>
<name sortKey="Myers, J" uniqKey="Myers J">J. Myers</name>
</author>
<author>
<name sortKey="Marth, G" uniqKey="Marth G">G. Marth</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Wickham, H" uniqKey="Wickham H">H. Wickham</name>
</author>
</analytic>
</biblStruct>
<biblStruct></biblStruct>
</listBibl>
</div1>
</back>
</TEI>
<affiliations>
<list>
<country>
<li>Danemark</li>
</country>
</list>
<tree>
<country name="Danemark">
<noRegion>
<name sortKey="Menzel, Peter" sort="Menzel, Peter" uniqKey="Menzel P" first="Peter" last="Menzel">Peter Menzel</name>
</noRegion>
<name sortKey="Krogh, Anders" sort="Krogh, Anders" uniqKey="Krogh A" first="Anders" last="Krogh">Anders Krogh</name>
<name sortKey="Ng, Kim Lee" sort="Ng, Kim Lee" uniqKey="Ng K" first="Kim Lee" last="Ng">Kim Lee Ng</name>
</country>
</tree>
</affiliations>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Sante/explor/MersV1/Data/Main/Exploration
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 001254 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/Main/Exploration/biblio.hfd -nk 001254 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Sante
   |area=    MersV1
   |flux=    Main
   |étape=   Exploration
   |type=    RBID
   |clé=     PMC:4833860
   |texte=   Fast and sensitive taxonomic classification for metagenomics with Kaiju
}}

Pour générer des pages wiki

HfdIndexSelect -h $EXPLOR_AREA/Data/Main/Exploration/RBID.i   -Sk "pubmed:27071849" \
       | HfdSelect -Kh $EXPLOR_AREA/Data/Main/Exploration/biblio.hfd   \
       | NlmPubMed2Wicri -a MersV1 

Wicri

This area was generated with Dilib version V0.6.33.
Data generation: Mon Apr 20 23:26:43 2020. Site generation: Sat Mar 27 09:06:09 2021